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INTRODUCTION 

Nature  of  the  problem 

Breast  cancer  is  a  leading  cause  of  death  in  women,  causing  an  estimated  44,000  deaths  per  year 
(1).  Mammography  is  the  most  effective  method  for  die  early  detection  of  breast  cancer  (2-5)  and  it  has 
been  shown  that  periodic  screening  of  asymptomatic  women  does  reduce  mortality  (6-11).  Various 
medical  organizations  have  recommended  the  use  of  mammograidiic  screening  for  the  early  detection  of 
breast  cancer  (3).  Thus,  mammogr^hy  is  becoming  one  of  the  largest  volume  x-ray  procedures 
routinely  interpreted  by  radiologists. 

It  has  been  reported  that  between  30  to  50%  of  breast  carcinomas  detected  mammogrrqdiically 
demonstrate  clusters  of  microcalcifications  (12-14),  although  about  80%  of  breast  carcinomas  reveal 
microcalcifications  upon  microscopic  examinatimi  (15-18).  In  addition,  studies  indicate  that  26%  of 
mmpalpable  cancers  present  mammographically  as  a  mass  while  18%  {xesent  both  with  a  mass  and 
microcalcifications  (19).  Although  mammogrr^y  is  currently  the  best  method  for  the  detection  of 
breast  cancer,  between  10-30%  of  women  who  have  breast  cancer  and  undergo  mammograidiy  have 
negative  mammograms  (20-24).  In  i^proximately  two-thirds  of  these  false-negative  manunograms,  the 
radiologist  failed  to  detect  die  cancer  that  was  evident  retroqiectively  (23-26).  Low  conspicuity  of  the 
lesion,  eye  fatigue  and  inattoitivaiess  are  possible  causes  for  these  misses.  We  believe  that  the 
effecdvoiess  (early  detection)  and  efficiency  (rafad  diagnosis)  of  screening  procedures  could  be 
increased  substantially  by  use  of  a  computer  system  that  successfully  aids  the  radiologist  by  indicating 
locations  of  suspicious  abncnmalities  in  manunograms. 

Many  breast  cancns  are  detected  and  refinred  ftn  surgical  biopsy  on  die  basis  of  a  radiogrtqdiicaUy 
detected  mass  lesion  or  cluster  of  microcalcifications.  Althou^  general  rules  for  die  differentiatitxi 
betwera  benign  and  malignant  breast  lesions  exist  (20,27),  considerable  misclassification  of  lesions 
occurs  with  the  current  m^hods.  On  avnage,  only  10-30%  of  masses  refmed  fm  surgical  breast 
tnop^  are  actually  malignant  (20,28).  Surgical  biopsy  is  an  invasive  tedmique  dud  is  an  expensive  and 
traumatic  eiqietience  for  die  patioit  and  leaves  physical  scats  that  may  hinder  later  diagnoses  (to  the 
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extent  of  requiring  repeat  bit^es  for  a  radiographic  tumor-simulating  scar).  A  computerized  method 
capable  of  detecting  and  analyzing  the  characteristics  of  benign  and  malignant  masses,  in  an  objective 
maimer,  should  aid  radiologists  by  reducing  the  numbers  of  false-positive  diagnoses  of  malignancies, 
thereby  decreasing  patient  morbidity  as  well  as  the  number  of  surgical  biopsies  performed  and  their 
associated  complications. 

The  development  of  computer  methods  to  assist  radiologists  is  a  timely  project  in  the  sense  that 
digital  radiogrtq)hy  is  on  the  threshold  of  widespread  clinical  use.  The  arrival  of  digital  radiographic 
systems  allows  for  the  acquisition  of  image  data  in  a  format  accessible  to  computerized  schemes.  The 
potential  significance  of  this  research  inoject  lies  in  the  fact  that  if  the  rtetectability  of  cancers  can  be 
increased  by  employing  a  computer  to  aid  the  radiologist's  diagnosis,  then  the  treatment  of  patients  with 
cancer  can  be  initiated  eariier  and  their  chance  of  survival  improved. 

The  systematic  and  gradual  introduction  of  computer-assisted  interpretation  to  radiologists  that  is 
presented  in  this  proposal  is  very  important  in  that  it  allows  for  a  iiKxle  of  presentation  with  minimum 
modification  to  the  current  reading  habits  of  radiologists  and  does  not  require  a  "digital"  department  in 
which  reading  must  be  drme  from  a  CRT  screen.  These  two  issues  are  of  concern  since  (1)  some 
radiologists  are  not  comfrwtable  with  computer-based  methods  and  (2)  primary  diagnosis  from  a  CRT 
display  is  still  controversial.  However,  the  introduction  of  conputer  vision  to  radiologists  presented  in 
this  proposal  is  not  affected  by  either  concern.  In  addition,  when  filmless  image  acquisition  and/or 
digital  (PACS)  radiology  dqiartinents  are  commonplace  in  the  future,  the  conputer-vision  module  can 
be  immediately  interfaced  to  electronic,  filmless  imaging  and  reading  areas. 

Background  of  previous  work 

In  the  1960's  and  70's,  several  investigators  attenpted  to  analyze  mammogrrphic  abnormalities 
with  conputers.  Winsbng  et  al.  (29),  in  an  eaiiy  study,  examined  areas  of  increased  density  in 
crmtralateral  breasts.  They  felt  that  their  results  demcmstrated  the  feasibility  fw  future  computer 
interpretation  of  mammograms.  Spiesbeiger  (30)  developed  various  feature-extraction  techniques  and  a 
two-view  voification  method  involving  medio-lateral  oblique  and  cranio-caudal  views  to  detect 
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miciocaicifications.  Kimme  et  al.  (3 1 )  developed  a  computerized  method  for  the  detection  of  suspicious 
abnormalities  in  mammograms  based  on  the  statistical  measures  of  textural  features.  They  tested  their 
algorithm  on  7  patient  cases.  A  similar  i^ipioach  using  texture  analysis  and  bilateral  comparison  was 
also  e^^}loyed  by  Hand  et  al  (32)  and  Semmlow  (33)  in  the  computerized  localization  of  suspicious 
abnormal  areas  of  breasts.  Their  results  yielded  a  66%  true-positive  rate  with  approximately  26  false 
suspicious  areas  per  image.  With  regard  to  classification  methods,  Ackerman  et  al.  (34),  using  digital 
xeroradiographs,  devised  four  measures  of  malignancy:  calcification,  spiculation,  roughness  and  sha^, 
to  perform  classificatimi  on  specific  areas  selected  by  hurruui  observers.  The  authors  viewed  their 
research  as  only  a  small  step  toward  the  automated  reading  of  xeroradiographs  and  appeared  to 
discontinue  prematurely  their  computer  vision  work.  The  same  group  (35)  did,  however,  attempt  to 
improve  diagnosis  by  using  36  radiogrtqrhic  properties  which  were  evaluated  semi-quantitatively  by  a 
radiologist  for  input  to  a  computer  decision  tree.  Wee  et  al.  (36)  and  Fox  et  al.  (37)  performed 
preliminary  studies  on  the  classification  of  microcalcifications.  These  previous  studies  demonstrated  the 
potential  capability  of  using  a  computer  in  the  detection  of  tiiartmiogri^^c  abnormalities.  Their  results, 
however,  yielded  a  large  number  of  false-positives  and  were  based  on  small  data  sets. 

Conqxiter-aided  diagnosis,  in  general,  has  attracted  little  attention  during  the  last  decade,  perhaps 
due  to  the  inconvenience  involved  in  obtaining  a  radiograph  in  digital  fimnat  Recent  w(»k,  though, 
shows  a  promising  future.  Magnin  et  al.  (38)  and  Caldwell  (39)  used  texture  analysis  to  evaluate  the 
breast's  parenchymal  pattern  as  an  indicator  of  canco^  risk.  These  preliminary  studies  raised  many 
unanswered  questions  regarding  topics  ranging  from  the  digital  recording  process  to  the  type  of 
numerical  risk  coefficient  employed.  Thus,  fiuther  studies  using  texture  analysis  ate  indicated.  The 
work  by  Fam  and  Olson  (40,41)  on  the  conqMiter  analysis  of  mammograms  is  «ic(Miraging;  however, 
dieir  method  has  rmly  been  tested  on  20  mammogrEqphic  re^ons  of  interest  (each  roughly  half  a 
mamnoogram).  Davies  and  Dance  (42)  have  reported  cm  their  autmnatic  method  fix' the  detection  of 
clustered  calcifications  u^g  local  gray-level  diresholding  and  also  a  clustering  rule.  Their  results 
yielded  a  true-positive  rate  of  96%;  however,  no  indications  of  the  subtlety  and  size  of  the  calcifications 
were  given.  Astley  et  al.  (43),  Gritnaud  et  al.  (44)  and  Jin  et  al.  (45)  recently  reprxted  on  their  methods 
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for  the  ^tectitm  of  breast  lesions.  Karssemeijer  (46)  has  J^scribed  a  stochastic  method  based  on 
Bayesitm  decision  theory  that  spears  promising.  Lai  et  al.  (47)  and  Brzakovic  et  al.  (48)  are  also 
developing  techniques  for  the  detection  of  mass  lesions.  Hie  actual  performance  level  and  difficulty  of 
the  databases,  however,  ate  unknown.  Gale  et  al.  (49)  and  Getty  et  al.  (SO)  are  both  developing 
computer-based  classifiers,  which  take  as  input  diagnostically-relevant  features  obtained  from 
radiologists'  readings  of  breast  images.  Getty  et  al.  found  that  with  the  aid  of  the  classifier,  community 
radiologists  performed  as  well  as  unaided  expert  mammognq^rs  in  making  benign-malignant 
decisions.  Swett  et  al.  (51,52)  are  developing  an  expert  system  to  provide  visual  and  cognitive 
feedback  to  the  radiologist  using  a  critiquing  approach  combined  with  an  expert  system.  The  system 
has  been  demonstrated,  though  not  tested. 

We  in  the  Kurt  Rossmarm  Laboratories  for  Radiologic  Image  Research  at  The  University  of 
Chicago  have  vast  experience  in  developing  various  computer-aided  diagnosis  (CAD)  methods  in 
manunography,  chest  radiography,  and  angiography  (53-66).  We  believe  that  our  CAD  methods  in 
digital.mammogrsq)hy,  which  include  the  computerized  detection  of  microcalcifications  and  masses, 
have  achieved  levels  of  sensitivity  and  specificity  that  warrant  testing  in  a  clinical  environment 

Our  detection  scheme  for  clustoed  microcalcifications  includes  a  preprocessing  step  referred  to  as 
a  difference-image  a{qnx)ach  (53,54).  Basically,  the  cxiginal  digital  mammogram  is  spatially  filtered 
twice:  once  to  enhance  the  signal-to-noise  ratios  of  the  microcalcifications  and  a  second  time  to 
suppress  thent  The  difference  between  the  two  resulting  {Hocessed  images  yields  an  image  (a 
difference  image)  in  which  the  variations  in  background  density  ate  largely  removed. 
Microcalcifications  are  then  segnmited  from  die  diffoence  image  using  global  gray-level  thresholding 
and  local  thresholding  techniques.  The  segmented  image  is  next  subjected  to  feature-extraction 
techniques  in  cadn  to  remove  signals  that  likely  arise  from  structures  other  than  microcalcifications. 

An  area  filter  (56),  based  on  mathematical  moi][^logy,  is  used  to  eliminate  stiudl  features.  Next, 
each  region  of  interest  that  ctmtains  remaining  features  is  subjected  to  low-fiequency  background 
correction  and  is  characterized  by  the  first  moment  of  its  power  spectrum,  defined  as  the  weighted 
average  of  radial  spatial  frequency  ova*  the  two-dimensional  power  spectrum  (55).  A  clustering  filter 
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(57)  is  next  used  so  that  only  clusters  that  contain  more  than  a  preselected  number  of  signals  within  a 
region  of  preselected  size  are  retained  by  the  computer.  The  coiiq)uterized  scheme,  using  78 
mammograms  (39  normal  and  39  abnormals)  in  which  most  clusters  were  quite  subtle,  the  scheme 
yielded  a  sensitivity  of  85%  with  tq)proximately  2.5  false-positive  detections  per  image  (58). 

The  computerized  scheme  for  detection  of  clustered  microcalcifications  (55)  developed  at  The 
University  of  Chicago  has  been  tested  as  an  aid  to  radiologic  diagnosis.  Using  a  database  of  60 
clinical  mammograms,  half  of  which  contained  subtle  clusters  of  microcalcifications,  a  human 
observer  study  was  conducted  in  order  to  examine  the  effect  of  the  computer- vision  aid  on 
radiologists'  performance  in  a  situation  that  simulated  rapid  interpretation  of  screening  mammograms. 
The  computer  scheme  attained  an  87%  true-positive  detection  rate  with  an  average  of  four  false¬ 
positive  clusters  per  image.  The  effect  of  the  number  of  false-positive  detections  on  radiologist 
performance  was  also  examined  by  simulating  a  computer  performance  level  of  87%  sensitivity  with 
one  false-positive  detection  per  image.  Radiologist  detection  performance  was  evaluated  using  ROC 
(receiver  operating  characteristic)  methodology  (68).  It  was  found  from  the  ROC  analysis  that  there 
was  a  statistically  significant  improvement  in  the  radiologists'  accuracy  when  they  were  given  the 
computer-generated  diagnostic  information  (at  either  false-positive  level),  con^ared  with  their 
accuracy  obtained  without  the  coo:q)uter  output 

Our  scheme  for  the  detection  of  mammogr^hic  masses  is  based  on  deviations  from  the 
architectural  symmetry  of  normal  right  and  left  breasts,  with  asymmetries  indicating  potential  masses 
(60,61).  The  input  to  the  computerized  scheme,  for  a  given  patient  are  the  four  conventional 
mammograms  obtained  in  a  routine  soeening  examination:  the  right  cranio-caudal  (CQ  view,  die  left 
CC  view,  the  right  medio-lateral-oblique  (MLO)  view,  and  die  left  MLO  view.  Afro:  automatic 
registratirm  of  corresponding  left  and  ri^t  breast  images,  a  nonlinear  subtraction  technique  is 
employed  in  which  gray-level  diresholding  is  performed  (»  die  individual  mammograms  prior  to 
subtraction.  Ten  images  thresholded  widi  different  cutoff  gray  levels  are  obtained  from  the  right 
breast  image,  and  ten  are  obtained  from  the  left  breast  image.  Next,  subtraction  of  the  corresponding 
right  and  left  breast  images  is  poformed  to  generate  ten  bilateral-subtraction  images.  Run-length 
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analysis  is  then  used  to  link  the  data  in  the  various  subtracted  images.  This  linking  process 
accumulates  the  information  from  a  set  of  10  subtraction  images  into  two  images  that  contain  locations 
of  suspected  masses  for  the  left  and  right  breasts.  Next,  feature-extraction  techniques,  which  include 
morphological  filtering  and  analysis  of  size,  shape  and  distance  from  border,  are  used  to  reduce  the 
number  of  false-positive  detections.  Currently,  using  150  pairs  of  clinical  mammograms  (from  75 
cases),  the  approach  achieves  a  true-positive  detection  rate  of  approximately  85%  with  3  to  4  false¬ 
positive  detections  per  image  (62). 

We  have  also  investigated  the  application  of  artificial  neural  networics  to  the  detection  and 
classification  of  mammographic  lesions.  We  used  an  artificial  neural  netwoik  (ANN)  to  extract 
microcalcification  image  data  from  digital  mammograms  (59).  The  ANN,  which  was  supplied  with 
the  power  spectra  of  remaining  suspected  regions  (from  the  CAD  scheme)  as  input,  distinguished 
actual  clustered  microcalcifications  from  false-positive  regions  and  was  able  to  eliminate  many  of  the 
false  positives.  Also,  we  are  applying  ANNs  to  the  decision-making  task  in  mammography  (63). 
Three-layer,  feed-forward  neural  networks  with  a  back-propagation  algorithm  were  trained  for  the 
interpretation  of  mammograms  based  on  features  extracted  from  mammograms  by  experienced 
radiologists.  The  database  for  input  to  the  ANN  consisted  of  features  extracted  from  133  textbook 
cases  and  60  clinical  cases.  Performance  of  the  ANN  was  evaluated  by  ROC  analysis.  In  tests,  using 
43  initial  image  features  (related  to  masses,  microcalcifications  and  secondary  abnormalities)  that  were 
later  reduced  to  14  features,  the  performance  of  the  neural  networic  was  found  to  be  higher  than  the 
average  performance  of  attending  and  resident  radiologists  in  classifying  benign  and  malignant 
lesions.  At  an  optimal  threshold  for  the  ANN  ou^Hit  valiK,  the  ANN  achieved  a  classification 
sensitivity  of  100%  for  malignant  cases  with  a  false-positive  rate  of  only  41%,  whereas  the  average 
radiologist  yielded  a  sensitivity  of  only  89%  with  a  false-positive  rate  for  classification  of  60%. 

We  are  also  developing  computer-aided  methods  for  the  interpretafimi  of  digital  chest 
rariingraphs,  such  as  in  the  detection  of  pulmonary  nodules,  interstitial  infiltrates,  pneumothorax  and 
catdiomegaly  (67,69-75).  The  computer-vision  scheme  for  the  detection  of  lung  nodules  is  based  on 
a  difference-image  approach,  which  (like  the  one  described  above  for  detection  of  clustered 
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microcalcifications)  is  novel  in  that  it  attempts  to  remove  the  structured  anatomic  background  before 
!^lying  feature-extraction  techniques.  After  the  difference  between  the  signal-enhanced  image  and 
the  signal-suppressed  image  is  obtained,  gray-level  thresholding  and  feature-extraction  techniques 
(involving  the  size,  contrast  and  shape  of  the  detected  features)  are  performed  by  the  computer  to 
identify  the  locations  of  possible  nodules.  More  recently,  false-positive  detections  have  been  reduced 
by  adding  nonlinear  filters  to  the  difference-image  step  and  additional  feature-extraction  techniques 
based  on  detailed  analyses  of  the  false  positives. 

The  research  team  in  the  Rossmann  Lab  also  has  considerable  experience  in  evaluation  of  factors 
affecting  image  quality  and  diagnostic  accuracy  in  digital  radiogr^hy.  We  have  investigated  basic 
imaging  properties  including  the  characteristic  system  response,  spatial  resolution  properties  and  noise 
properties  of  various  types  of  digital  radiographic  imaging  systems  (76-86).  The  effects  of  various 
physical  parameters,  such  as  detector  system,  sampling  ^rture,  pixel  size,  number  of  quantization 
levels,  exposure  level  and  display  aperture,  were  examined  at  various  stages  of  the  digital  imaging  chain 
(87-91).  Knowledge  gained  in  this  research  will  be  useful  in  understanding  the  effect  of  spatial 
resolution  and  noise  on  the  performance  of  computer-assisted  interpretation. 

In  developing  methods  for  computer-assisted  interpretations,  it  is  crucial  to  employ  appropriate 
means  for  evaluation.  We  have  carried  out  various  observer  performance  studies  in  comparing  the 
detection  capability  of  new  techniques  both  with  regard  to  simulated  and  clinical  images.  1 8-altemative 
forced-choice  observer  studies  were  enqrloyed  to  examine  the  effect  of  pixel  size  on  the  threshold 
contrast  of  simple  objects  digitally  superimposed  on  uniform  brarkground  noise  (92-94)  and  the  effect  of 
structured  background  on  the  detectability  of  simulated  stenotic  lesions  (95).  In  an  obsover  study  with 
radiologists  using  clinical  images,  ROC  analysis  was  employed  in  order  to  examine  the  effects  of 
different  display  modalities  (film  and  CRT)  on  diagnostic  accuracy  in  digital  chest  radiography  (96). 
Similar  studies  were  pafcmned  to  investigate  die  effect  of  data  compression  ratios  cm  detectability  (97), 
the  conqiarison  of  computed  radiography  with  conventirmal  screen/film  imaging  (98),  and  the  utility  of 
con^MitN-assisted  interpretation  in  mammography  (55)  and  chest  (71).  In  addition,  we  have  used  ROC 
and  FROC  analyses  to  evaluate  the  performance  level  of  the  computerized  schemes  and  the  artificial 
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neural  netw<Mks  (99).  This  broad  experience  will  provide  the  basis  for  developing  similar  methodology 
to  evaluate  the  conqniter- vision  modules  for  mammography  proposed  in  this  application. 

Purpose  of  the  present  work 

The  main  hypothesis  to  be  tested  is  that  given  a  dedicated  computer-vision  module  for  the 
computer-assisted  interpretation  of  mammograms,  the  diagnostic  accuracy  for  mammograhic 
interpretation  will  be  improved,  yielding  earlier  detection  of  breast  cancer  (i.e.,  a  reduction  in  the 
number  of  missed  lesions)  and  a  reduction  in  the  number  of  benign  cases  sent  to  biopsy. 

Computer-aided  diagnosis  (CAD)  can  be  defined  as  a  diagnosis  made  by  a  radiologist  who  takes 
into  consideration  the  results  of  a  computerized  analysis  of  radiogr^hic  images  and  uses  them  as  a 
"second  opinion"  in  detecting  lesions  and  in  making  diagnostic  decisions.  The  final  diagnosis  would 
be  made  by  the  radiologist.  Although  mammogr^hy  is  currently  the  best  method  for  the  detection  of 
breast  cancer,  between  10-30%  of  wtnnen  who  have  breast  cancer  and  undergo  mammography  have 
negative  mammograms  (20-24).  It  has  been  suggested  that  double  reading  (by  two  radiologists)  may 
increase  sensitivity  (100-102).  Thus,  one  aim  of  CAD  is  to  increase  the  efficiency  and  effectiveness 
of  screening  procedures  by  using  a  counter  system,  as  a  "second  opinion  or  second  reading,"  to  aid 
the  radiologist  by  indicating  locations  of  suspicious  abnormalities  in  mammograms. 

If  a  suspicious  region  is  detected  by  a  radiologist,  he  or  she  must  then  visually  extract  various 
radiogr^hic  characteristics.  Using  these  features,  tbs  radiologist  then  decides  if  the  abntHinality  is 
likely  to  be  malignant  or  benign,  and  what  course  of  action  should  be  recommended  (i.e.,  return  to 
screening,  return  for  follow-up  or  send  fw  biopsy).  Many  patients  are  referred  for  surgical  biopsy  on 
the  basis  of  a  radiographically  delected  mass  lesion  or  cluster  of  microcalcificadons.  On  average,  only 
10-20%  of  masses  referred  fw  surgical  breast  bioi»y  are  actually  malignant  (20,28).  Thus,  another 
aim  of  CAD  is  to  extract  and  analyze  the  charactnistics  of  boiign  and  malignant  lesions  in  an  objective 
manno'  in  order  to  aid  the  radirdogist  by  reducing  the  numbers  of  false-positive  diagnoses  of 
malignancies,  thereby  decreasing  pidiait  morbidity  as  well  as  the  number  of  surgical  biopsies 
performed  and  their  associated  complications 
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Methods  of  approach 

The  objective  of  the  proposed  research  is  to  develop  a  dedicated  computer-vision  module  for  use  in 
mammography  in  order  to  increase  the  diagnostic  decision  accuracy  of  radiologists  and  to  aid  in 
mammographic  screening  programs.  The  computer-aided  diagnostic  module  will  incorporate  various 
novel  computer-vision  and  artificial  intelligence  schemes  already  under  development  in  the  Rossmaim 
Laboratories  at  the  University  of  Chicago. 

The  specific  objectives  of  the  research  to  be  addressed  are: 

(1)  Further  development  of  advanced  con^terized  schemes  for  the  detection  and  classification  of 
masses  and  microcalcifications  in  digital  mammograms.  This  part  of  the  research  involves  quantitative 
analysis  of  the  radiographic  characteristics  of  masses  and  microcalcifications,  and  the  decision-making 
processes  used  by  radiologists  in  making  a  decision  with  respect  to  the  likelihood  of  malignancy  and  in 
choosing  the  appropriate  course  of  action. 

(a)  Further  development  of  an  advanced  computerized  detection  scheme  for  masses  that  uses 
bilateral-subtraction  techniques,  gray-level  thresholding,  and  analysis  of  various  image  features. 

(b)  Further  development  of  an  advanced  computerized  detection  scheme  for  microcalcifications 
that  uses  linear  and  nonlinear  spatial  filters,  spectral  content  analysis  and  various  morphological  filters 
for  size,  contrast  and  cluster  analyses. 

(c)  Further  development  of  advanced  conqjuterized  classification  schemes  for  masses  and 
microcalcifications  that  use  computer-vision  techniques  and  artificial-intelligence  techniques  to  calculate 
a  probability  of  malignancy. 

(2)  Envelopment  of  a  dedicated  module  witii  man-machine  interfaces  appropriate  for  the  effective  and 
efficient  use  of  the  CAD  schemes.  Final  diagnostic  decisions  will  remain  with  the  radiologists. 

(a)  Optimization  of  the  CAD  software. 

(b)  Examination  of  various  methods  of  presenting  the  conqHiter’s  results  to  the  radiologist 

(c)  Development  of  a  prototype  intelligent  rtKxiular  workstation  using  a  high-speed  (fast  CPU  & 
large-capacity  memory)  computer  and  a  high-resolution,  filmless  CRT  display. 
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(3)  Evaluation  of  the  efficacy  and  efficiency  of  the  dedicated  computer-vision  module  for 
mammognqihy  using  a  large  clinical  database.  This  part  will  use  both  film  and  filmless  media  for  image 
acquisition  and  display. 


BODY:  Experimental  methods  and  results  to  date 

(1)  Development  of  the  computerized  schemes  for  the  detection  and  classification  of 
masses  and  microcalcifications  in  digital  mammograms. 

Experimental  methods 

The  computerized  schemes  for  detection  and  classification  are  at  various  levels  of  development. 
These  schemes  will  be  used  as  aids  by  radiologists  in  the  interpretation  of  mammograms.  For  the 
development  and  testing  of  these  algorithms,  we  will  collect  500  mammogr^hic  cases  from  the 
Dqiartment  of  Radiology.  Initially,  these  cases  will  include  screen/film  mammograms  that  are  currently 
acquired  in  the  department  Later,  the  database  will  include  digital  images  both  from  coriq)uted 
radiogrrqrhy  (CR)  units  (stimulable  phosfriior)  and  from  a  (XD  array  detector  that  will  be  installed  on 
our  digital  biospy  unit  (see  Section  6  on  Facilities). 

(a)  Development  of  the  computerized  detection  scheme  for  masses. 

The  con^ter-vision  scheme  is  based  on  deviations  from  the  architectural  symmetry  of  normal 
right  and  left  breasts,  with  asymmetries  indicating  potential  masses  (60-62).  Thus,  we  will  continue 
investigating  subtractirxi  techniques  as  a  means  to  increase  the  conspicuity  of  masses  in  mammograms. 
These  techniques  will  be  combined  with  analysis  of  individual  mammograms.  The  input  to  the 
computerized  scheme,  frn*  a  given  patient,  are  the  four  convmtitmal  breast  images  obtained  in  a  routine 
screening  examination:  the  tight  (X  view,  the  left  CC  view,  die  right  MLO  view,  and  the  left  MLO 
view.  Mammograms  will  be  digitized  using  a  laser  scanner  digitizer  (2K  by  2K  matrix).  In  the  initial 
detection  stage,  the  digital  imr^e  can  be  reduced  to  a  512  by  512  matrix  (with  an  effective  pixel  ^  of 
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0.4  mm)  due  to  the  large  size  of  masses  relative  to  tl^  pixel  size.  An  automated  alignment  technique, 
which  we  have  developed,  will  be  used  to  align  corresponding  left  and  right  breast  images  and  also 
images  of  the  same  breast  obtained  over  some  time  period.  The  automated  alignment  of  two 
corresponding  breast  images  will  be  performed  in  three  stages:  image  segmentation,  image  feature 
selection  and  image  registration.  During  image  segmentation,  the  breast  area  will  be  isolated  from  the 
exterior  region  using  a  technique  which  combines  multiple  gray-level  thresholding  and  morphological 
filtering.  With  image-feature  selection,  landmarks  on  each  breast  image  will  be  determined.  These 
landmarks  are  the  breast  border  and  the  nipple  position.  Since  the  image  features  around  the  nipple 
often  include  a  thicker  skin  line  and  greater  subcutaneous  parenchymal  opacity,  a  band  signature  method 
wiU  be  employed  to  identify  the  nipple  position  along  the  breast  border.  During  image  registration, 
translation  and  rotation  of  one  of  the  breast  images  relative  to  the  other  will  be  determined  using  a 
partial-border  matching  technique. 

Once  the  two  images  are  aligned  relative  to  each  other,  the  detection  of  possible  asymmetries 

between  the  border-matched  right  and  left  breast  images  is  achieved  by  correlation  of  the  two 

mammograms,  using  a  bilateral-subtraction  technique.  We  are  investigating  linear  and  nonlinear 

subtraction  methods.  With  linear  subtraction,  the  two  breast  images  are  subtracted  (using  a  left-minus- 

« 

light  convention)  and  then  gray-level  thresholding  is  performed  in  (»der  to  segment  the  image  into 
possible  locations  of  suspect  masses.  With  the  nonlinear  technique,  gray-level  thresholding  is 
performed  prior  to  subtraction.  This  initial  thresholding  eliminates  some  normal  anatomic  background 
from  further  analysis.  A  selected  number  of  images  thresholded  with  different  cutoff  gray  levels  is 
obtained  from  the  right  breast  image,  and  a  corresponding  number  is  obtained  from  the  left  breast 
image.  Subtraction  of  ten  sets  of  corresponding  right  and  left  Ixeast  images,  each  thresholded  at  ten 
different  levels,  u  performed  to  genoiate  ten  bilateral-subtraction  images  (containing  infrvmation  on 
suspicious  masses  in  the  two  original  mammograms).  A  linking  process  then  accumulates  the 
information  into  two  images,  called  runloigth  images,  whoe  the  value  of  each  pixel  in  each  image 
indicates  how  often  the  corresponding  location  in  die  set  of  10  subtraction  images  has  gray  levels  above 
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or  below  a  particular  cutoff  gray  value.  These  images  are  next  tliiesholded  to  yield  the  suspicious  areas 
and  submitted  fw  feature  extraction. 

Feature^traction  techniques  will  be  performed  on  both  the  runlength  images  and  the  original 
matiunograms  to  reduce  the  number  of  false-positive  detections.  Initially,  a  morphological  closing 
(^ration  followed  by  a  opening  operation  will  be  used  to  eliminate  isolated  pixels  and  merge  small 
neighboring  features.  Next  a  size  test  will  be  used  to  eliminate  features  that  are  smaller  than  a 
(Medetermined  cutoff  size.  A  border  test  will  be  used  to  eliminate  artifact  features  arising  from  any 
border  misalignment  that  occurred  during  digitizati<m  and  registration.  On  the  original  images, 
su^)ected  regions  will  be  subjected  to  regicm-growing  techniques  and  then  examined  with  respect  to 
size,  shape  and  contrast,  in  order  to  eliminate  features  of  elongated  shape  and  diffuse  coimective  tissue. 

Further  analysis  will  be  performed  by  correlating  geometrically  the  information  (Stained  from  the 
(X  mammographic  pair  and  the  MLO  pair.  Since  the  two  views  are  obtained  from  the  same  breast 
image,  the  appearance  of  a  mass  in  one  view  of  a  breast  will  be  expected  to  exist  in  the  other  view  of  the 
same  breast.  This  geometric  correlation  will  need  to  take  into  account  the  '*l‘^erent  angles  of  projection 
of  the  3-dimensional  Ineast  in  forming  the  two  2-dimensi(mal  images  and  the  possibly  different  amounts 
of  physical  compression  iq)plied  to  the  breast  in  question  during  acquisition  of  the  two  views. 

In  addition  to  conqraring  the  tight  and  left  Ineast  images  of  a  given  view  obtained  at  a  given  time, 
comparisons  will  be  made  between  images  of  the  same  ineast  obtained  at  die  same  projection  but  at 
different  times  in  (nder  to  note  changes  in  the  breast  This  follows  the  mediodology  enqiloyed  by 
mammogrEqdiets  when  interpreting  a  case  widi  previous  examinations  availabte.  Similar  subtraction 
techniques  and  feature^xtraction  mediods  will  be  employed.  Use  of  histogram  specification  methods 
(103),  however,  may  be  necessary  in  order  to  match  the  gray-level  distributions  of  the  two  images  (that 
woe  obtained  at  diffemit  times)  v^ien  there  exists  a  large  variation  in  die  eiqxisuie  techniques 
employed. 
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Results  to  date 

Currently,  the  scheme  employs  two  pairs  of  conventional  screen-film  mammograms  (the  right 
and  left  MLO  views  and  CC  views),  which  are  digitized.  After  the  right  and  left  breast  images  in  each 
pair  are  aligned,  a  nonlinear  bilateral-subtraction  technique  is  employed  that  involves  linking  multiple 
subtracted  images  to  locate  initial  candidate  masses.  Various  features  ate  then  extracted  and  merged 
using  an  artificial  neural  network  in  order  to  reduce  false-positive  detections  resulting  from  the 
bilateral  subtraction. 

The  features  extracted  from  each  suspected  mass  lesion  include  geometric  measures,  gradient- 
based  measures  and  intensity-based  measures.  The  geometric  measures  are  lesion  size,  lesi 
circularity,  margin  irregularity,  and  lesion  compactness.  The  gradient-based  measures  are  the  ^ .  crage 
gradient  (based  on  a  3  by  3  Sobel  operator)  and  its  standard  deviation  calculated  within  the  specified 
region  of  interest  The  intensity-based  measures  are  local  coiUrast  average  gray  value,  standard 
deviation  of  the  gray  values  within  the  lesion,  and  the  ratio  of  the  average  to  the  standard  deviatiim. 
The  features  were  normalized  between  0  and  1  and  input  to  the  a  back-propagation,  feed-forward 
neural  netwc^.  The  ANN'S  structure  consisted  of  10  input  units,  one  hidden  layer  with  7  hidden 
units  and  one  output  unit.  In  this  task,  the  output  unit  ranged  from  0  to  1,  where  1  corresponded  to 
the  suspected  lesion  being  an  actual  mass  (i.e.,  a  true-positive  detection)  and  0  corresponded  to  the 
suspected  lesion  being  a  false-posiitve  detection  (and  thus,  allowed  to  be  eliminated  as  a  suspect 
lesion-candidate).  Based  on  the  performances  of  the  ANN  as  a  function  of  iteration,  in  terms  of  self- 
consistency  and  round  robin  analyses,  the  optimal  number  of  trairring  iterations  was  determined. 

ROC  (receiver  opnating  characteristic)  analysis  was  q^lied  to  evaluate  the  output  of  tbc  ANN  in 
tNins  of  its  ability  to  distinguish  between  actual  mass  lesions  and  false-{)ositive  detections.  The 
ouqxit  values  from  the  ANN  for  actual  masses  and  for  false-positive  detections  were  used  in  the  ROC 
analysis  as  die  decision  variable.  Basically,  the  ROC  curve  represents  the  true-positive  fraction  and 
the  false-positive  fraction  at  various  thresholds  of  the  ANN  ouqxit  ROC  analysis  was  used  a  an 
index  of  perftmnance  in  determining  the  "qitimal”  numbor  of  input  features,  the  "optimal”  number  of 
hidrten  units,  and  the  "optimal"  nuinber  of  training  iterations  of  the  ANN. 
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In  the  self-consistency  analysis,  the  ANN  achieved  an  Az  of  1.0  and  in  the  round-robin  analysis, 
the  ANN  achieved  an  Az  of  0.92  in  distinguishing  actual  masses  (true  positives)  from  false-positive 
detectios.  In  an  evaluation  study  using  the  154  pairs  of  clinical  mammograms  (90  pairs  with  masses 
and  64  pairs  without),  the  detection  scheme  yielded  a  sensitivity  of  95%  at  an  average  of  2,5 
false-positive  detections  per  image.  This  is  a  substantial  improvement  from  the  previous 
year’s  performance  of  85%  sensitivity  and  4  false-positive  detections  per  image. 

(b)  Development  of  the  computerized  detection  scheme  for  microcalciflcations. 

Microcalcifications  are  a  primary  indicator  of  cancer  and  ate  often  visible  in  the  mammogram  before 
a  palpable  tumor  can  be  detected.  Initially,  clinical  screen/film  mammograms  will  be  digitized  using  the 
laser  scanner  and  analyzed  in  the  2048  by  2048  matrix  format  in  order  to  retain  the  high  spatial- 
frequency  content  of  the  microcalcifications.  First,  the  original  mammograms  will  be  processed  to 
enhance  and  suppress  the  signal  of  the  microcalciflcations,  followed  by  calculation  of  a  difference 
image.  Both  linear  and  nonlinear  filters  will  be  investigated  for  enhancement  and  suppression. 

Previous  use  of  both  linear  and  nonlinear  filters  in  detecting  lung  nodules  in  digital  chest  images  has 
shown  that  while  both  types  of  filters  tended  to  detect  nodules,  locations  of  false  positives  differed. 
Thus,  a  combination  of  the  results  from  each  processing  technique  has  the  potential  to  yield  high 
sensitivity  and  reduce  the  number  of  false-positive  detections.  Exan^iles  of  filters  for  signal 
enhancement  include  a  linear  "matched"  filter  that  matches  the  profile  of  a  typical  microcalcification  and 
a  morphological  open  filter  (to  enlarge  the  rqipearance  of  microcalcifications).  Morphological  filtering 
(104)  is  basically  a  nonlinear  filtering  method  that  calculates  the  logical  AND  (erosion  function)  or  OR 
(dilation  function)  of  pixels  within  a  kernel  of  some  given  size  and  shqie.  When  extended  to  gray-scale 
images,  the  logical  AND  and  OR  iterations  can  be  ritlaced  by  minimum  and  maximum  operations.  By 
apjxopriately  choosing  the  size  and  shiqte  of  the  kernels,  as  well  as  the  sequences  of  the  AND  and  the 
OR,  the  filters  can  eliminate  groups  of  pixels  of  limited  size  cm*  merge  neighboring  pixels.  Exanqtles  of 
filters  for  signal  sui>ression  include  ring-shaped  filters  that  yield  either  the  average  or  median  value  of 
the  surrounding  normal  anatomic  background  (54). 
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The  difference  image  will  then  be  subjected  to  various  feature-extraction  techniques  to  reduce 
further  the  number  of  false-positive  detections.  These  techniques  will  test  fm*  size,  contrast  and  spectral 
content  of  neighboring  features.  New  methods  for  analyzing  dMse  features  will  involve  the  use  of 
morphological  filters.  For  example,  we  have  found  that  the  use  of  asymmetric  mmi^logical  filters  to 
eliminate  features  less  than  3  pixels  in  size  are  more  effective  and  efficient  than  use  of  a  point-by-point 
analysis  that  involves  counting  the  number  of  pixels  in  each  remaining  feature  and  comparing  it  to  a  size 
cutoff.  In  addition,  the  presence  of  clustering  of  the  microcalcifications  will  be  examined  since  singular 
microcalcifications  are  usually  not  cancerous.  The  morj^logical  kernel  for  the  clustering  test  will 
correspond  to  the  size  of  a  typical  cluster  (api»oximately  6  mm  in  diameter). 

Results  to  date 

Tlie  microcalcification  detection  scheme  consists  of  diree  steps.  First,  the  image  is  filtered  so  that 
the  signal-to-noise  ratio  of  microcalcifications  is  increased  by  sui^nession  of  the  normal  background 
structure  of  the  breast  Second,  potmtial  microcalcifications  are  extracted  from  the  filtered  image  with 
a  series  of  three  different  techniques:  a  global  thresholding  based  on  the  grey-level  histogram  of  the 
full  filtered  image,  an  erosion  operator  fw  eliminating  v«y  small  signals,  and  a  local  ad{q)tive  grey- 
level  thresholding.  Third,  some  false-positive  signals  are  eliminated  by  means  of  a  texture  analysis 
technique,  and  a  nonlinear  clustering  alg(»ithm  is  then  used  for  groiqnng  the  remaining  signals. 

In  our  ctHiqMita:  detecticm  scheme  it  is  neccesary  to  group  or  cluster  microcalcifications,  since 
clustered  microcalcifications  are  nKxe  clinically  significant  dian  are  isolated  microcalcifications.  In  the 
past  we  used  a  "growing”  technique  in  which  signals  (possible  microcalcifications)  woe  clustered  by 
grouping  those  that  were  within  some  predefined  distance  from  the  center  of  tile  growing  cluster.  In 
tiiis  paper,  we  introduce  a  new  technique  for  grouping  signals,  which  ctmsists  of  two  steps.  First, 
signals  tiiat  may  be  several  juxels  in  area  are  reduced  to  single  pixels  by  means  of  a  recursive 
transformation.  Second,  the  numbo*  of  signals  (non-zeo  pixels)  within  a  small  region,  typically 
3.2x3.2  nun,  are  counted.  Only  if  tiuee  tu  more  signals  are  present  within  such  a  region  are  they 
preserved  in  tile  outyut  image.  In  this  way,  isolated  signals  are  eliminated.  Furthermore,  this  method 
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can  diming  foisdy  detected  clusters,  which  were  identified  by  our  previous  detection  scheme,  based 
on  the  spatial  distribution  of  signals  within  the  cluster.  The  differences  in  performance  of  our  CAD 
scheme  fw  detecting  clustered  microcalcifications  using  the  old  and  new  clustering  techniques  was 
measured  using  78  mammograms,  containing  41  clusters.  The  new  clustering  technique  inqiroved 
our  detecti(Hi  scheme  by  reducing  the  false-positive  detection  rate  while  maintaining  a  sensitivity  of 
approximately  85%. 

We  also  ai^lied  artificial  neural  networks  to  the  differentiation  of  actual  "true"  clusters  of 
microcalcificatirms  from  normal  parenchymal  patterns  and  from  false  positive  detections  as  reprxted  by 
a  con^teiized  scheme.  The  differentiation  was  carried  out  in  both  the  spatial  and  spatial  frequency 
domains.  In  the  spatial  domain,  the  performance  of  the  neural  networks  was  evaluated  quantitatively  by 
means  of  RCK!  analysis.  We  found  that  the  networks  could  distinguish  clustered  microcalcifications 
from  normal  nonclustered  areas  in  the  frequency  domain,  and  that  they  could  eliminate  approximately 
50%  of  false-positive  clusters  of  microcalcificadons  while  [xeserving  95%  of  the  positive  clusters. 

The  number  of  false-positive  detections  was  even  further  reduced  when  a  shift-invariant  artificial 
iMural  netw(»k  (SIANN)  was  used  to  analyze  the  remaining  suspected  locations.  TheSIANNisa 
multilayer  back-propagation  neural  network  with  local,  shift-invariant  interconnections.  The  advantage 
of  the  SIANN  is  that  the  result  of  the  network  is  rxH  dqiendent  on  the  locations  of  the  clustered 
microcalcifications  in  the  input  layer.  The  petf<»mance  of  the  SIANN  was  evaluated  by  means  of  a 
jack-knife  method  and  RtX  analysis  using  a  database  of  168  regions  as  reported  by  the  CAD  schnne. 
Approximately  55%  of  die  false  positives  were  eliminated  without  loss  of  any  of  the  true-positive 
detections.  We  also  examined  the  effect  of  the  network  structure  on  the  performance  of  die  SIANN. 
These  techniques  led  to  a  performance  of  85%  sensitivity  with  approximately  0.7 
false-positive  detections  per  image. 

(c)  Development  of  computerized  classification  schemes. 

Various  feature-extraction  techniques  and  artificial  intelligenoe  schemes  will  be  investigated  in  order 
to  distinguish  malignant  masses  and/or  microcalcifications  from  benign  masses  and/w 
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micfocalcificiRions.  The  database  for  this  investigaticm  will  be  obtained  from  the  conventional  four 
screening  breast  images,  as  well  as  special  views  such  as  spot  conqxession. 

In  our  previous  work,  we  compiled  a  list  of  features  that  radiologists  use  in  distinguishing  between 
malignant  and  benign  masses.  These  features  include:  margin  spiculation  (number  of  spiculations, 
loigth  of  spiculation,  and  difference  between  spicules  and  local  linear  features),  shape  (linear  to 
sfdietical,  geom^rical  to  diffuse,  and  existence  of  satellite  lesions),  size  (mean  diameter),  margin 
characteristics  (cmnplete  to  inseparable  from  surround,  well-defined  to  indistinct,  and  presence  of  halo 
sign),  and  pattern  of  interior  (uniformity,  {xesence  of  well-defined  lucencies,  and  opacity  relative  to 
size).  The  analysis  of  spiculation  will  be  based  <»  a  novel  computer-vision  method  involving  the 
Fourier  analysis  of  the  fluctuations  around  the  ma^in  of  the  mass  in  question  (60).  The  conqNita*- 
extracted  margin  used  in  the  analysis  for  spiculation  also  crmtains  information  related  to  tiie  number  and 
length  of  spiculatirms.  Also,  prior  to  die  analysis  of  spiculatirm,  die  mass  is  extracted  from  die  normal 
anatomic  background  of  the  Ixeast  purachyma.  Currentiy,  regicm-growing  techniques  are  employed 
for  this  extraction.  Once  extracted,  the  shape  and  size  of  the  mass  can  be  easily  calculated.  The  size 
will  be  defined  as  die  effective  diameter  of  a  circle  that  has  the  same  area  as  the  extracted  mass.  The 
shape  will  be  expressed  by  a  degree  of  circularity,  which  will  be  defined  as  the  ratio  of  die  area  of  the 
mass  within  the  equivalent  circle  to  the  total  area  of  the  mass.  Masses  with  ill-defined  margins  are  more 
likely  to  be  malignant  than  diose  widi  relatively  well-rfefined  margiiis.  Thus,  a  margin  gradirat  test  will 
be  developed  to  measure  the  sharpness  of  the  margiiL  This  sharpness  will  be  defined  as  die  degree  of 
d«isify  change  aooss  the  margin  and  will  be  measured  perpendicular  to  the  margin  at  all  points  along 
diematgin.  The  patlem  of  the  interior  will  be  quantitatively  determined  from  die  spectral  content  of  the 
intaicM'. 

Features  related  to  the  classification  <rf  microcalcifications  include:  the  shape  of  die  individual 
microcalcifications  (rounded  to  iiv^ular,  linear,  and  Inanched),  uniformity  of  microcaldficatioos  widiin 
acluster  (uniformity  in  size,  shqie,  and  density),  distribution  of  the  microcalcifications  (diffosoiess 
and  shrqie  of  cluster)  and  presmee  of  macrocalci&aticMis.  The  size  and  riuqie  of  the  individual 
miaocalcifications  will  be  ddermined  by  the  onnputa’  using  an  effective  diameter  and  a  circularity 
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measuie,  respectively,  as  described  earlier.  Uniformity  within  a  cluster  will  be  assessed  by  calculating 
the  spread  of  values  for  a  particular  characteristic  such  as  size.  Once  a  cluster  has  been  defined,  its 
diffoseness  will  be  given  by  the  number  of  microcalcifications  per  unit  area  and  its  shiqK  will  be  defined 
using  a  circularity  measure. 

These  various  computer-determined  quantitative  measures  describing  the  mass  or  cluster  of 
microcalcifications  in  que^n  will  be  input  to  an  artificial  neural  n^ork  that  will  merge  the  features 
into  a  ptobid>ility  of  malignancy  for  use  by  radiologists.  As  mentioned  in  the  Background  section,  our 
work  with  a  neural  network  in  merging  human-rqxnled  mammographic  features  into  a 
nudignant/benign  decision  has  been  extremely  {xomising.  The  iiqput  data  (corresponding  to  the 
conqxiter-extracted  features  of  the  masses  and  microcalcificatitHis)  will  be  represented  by  numbers 
ranging  from  0  to  1  and  will  be  supplied  to  the  input  units  of  the  neural  network.  The  output  data  from 
the  neural  network  is  then  provided  from  output  units  through  two  successive  nonlinear  calculations  in 
die  hidden  and  output  layers.  The  calculation  at  each  unit  in  a  layer  includes  a  weighted  summation  of 
all  entry  numbers,  an  addition  of  a  certain  offset  number,  and  a  conversion  into  a  number  ranging  from 
0  to  1  using  a  sigmoid-shtqie  function  such  as  a  logistic  functitm.  Two  different  basic  processes  are 
involved  in  a  neural  network;  namely,  a  training  process  and  a  testing  process.  The  neural  network  will 
be  trained  by  a  back-fniqiagation  algorithm  (lOS)  using  input  data  (i.e.,  con^Hito’-repcxted  features)  and 
the  desired  corresponding  ou^t  data  (i.e.,  biopsy  or  follow-up  proven  truth  of  the  malignant  or  benign 
status  of  the  mass  or  microcalcifications  in  question),  for  a  variety  of  cases.  Once  trained,  the  noual 
network  will  accept  computer-reported  features  of  the  mass  or  microcalcifications  in  questirm  and  ou^t 
a  value  from  0  to  1  whoe  0  is  definitely  benign  and  1  is  definitely  malignant.  Based  on  die  distrilnitimi 
of  these  values  for  various  known  cases,  we  will  be  able  to  detramine  what  course  of  action  (e.g., 
biqsy,  follow-up  or  return  to  ncmnal  screening)  should  be  recommended  to  the  radiologisL 
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Results  to  date 
Ciassiflcation  of  masses 

Our  earlier  work  showed  that  a  back-propag^on,  feed-forward  artificial  neural  network  could 
merge  human-extracted  features  of  mammographic  tesions  into  a  likelihood  of  maliganncy  at  a  similar 
level  of  that  of  an  expert  manunogrs^her.  In  the  study  presented  here,  however,  ANN  is  used  to 
merge  computer-extracted  features  of  mass  lesions  into  a  likelihood  of  malignancy. 

The  method  takes  as  input  the  center  location  of  a  mass  lesion  in  question.  Next,  the  lesion  is 
segmented  from  the  breast  parenchyma  (background)  using  an  automatic  region  growing  technique 
and  various  features  of  the  lesion  are  extracted.  The  automatic  lesion  segmentation  involves  the 
analysis  of  the  size  of  the  grown  region  as  a  function  of  the  gray-level  interval  used  for  the  region 
growing.  Many  of  the  extracted  features  are  determined  from  a  cumulative  edge-gradient-tmentation 
histogram  analysis  modified  for  orientation  relative  to  a  radial  angle.  Input  to  an  ANN  consists  of 
four  features  from  the  gradient  analysis  along  with  the  average  gray  value  within  the  grown  lesion. 
The  gradient  measures  include  the  FWHM  (fiill  width  at  half  max)  of  the  cumulative  edge-gradient- 
orientation  histogram  calculated  from  pixels  within  the  lesimi  and  its  neighboring  surround,  and  from 
just  pixels  along  the  lesion  margin  (see  Figures  1-3).  These  measures  correspond  to  the  presence  of 
spiculation,  which  is  a  sign  of  malignancy  in  the  visual  interpretation  of  mammogta{riiic  masses.  The 
ANN’S  structure  consisted  of  5  input  units,  one  hidden  layer  with  4  hidden  units  and  one  output  unit, 
hi  this  task,  the  output  unit  tanged  from  0  to  1,  where  1  ctnresponded  to  the  lesion  being  malignant 
and  0  ctmespraided  to  the  lesion  being  benign.  Use  of  ROC  analysis  with  self-cmsistency  testing  and 
round-ibbin  testing  was  enqiloyed  as  discussed  in  the  jnevious  section. 

The  classification  tn^hod  was  evaluated  using  a  pathologically-confirmed  database  of  95  masses 
(57  malignant  and  38  bmign),  of  which  all  but  one  had  been  sent  to  biospy.  The  nuumnograms  in  the 
database  had  bera  digitized  to  a  pixel  size  of  0.1  mnL  Using  the  five  input  features,  an  Az  (area  under 
to  the  ROC  curve)  of  0.83  was  obtained  in  the  task  of  distinguishing  benign  fhnn  malignant  masses 
using  a  round-robin  method  for  evaluation.  However,  we  found  that  by  using  a  rule-based  decision 
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(Ml  one  of  the  features  (FWHM)  based  on  its  correspondence  to  visual  interpretation  methcxls,  prior  to 
use  of  the  ANN,  tte  performance  increased  yielding  an  Az  of  0.90  for  distinguLshing  between 
noialigiiaiit  and  benign  masses. 

Results  to  date 

Classification  of  microcalcifications 

The  analysis  of  microcalcifications  can  be  difficult  to  perform  consistently  for  human  observers 
leading  to  the  p(X)r  positive  predictive  value.  We  have  been  investigating  methcxls  to  identify 
computer-extracted  quantitative  features  of  microcalcifications  and  their  clusters  Oiat  can  be  used  to 
classify  malignant  and  benign  clustered  microcalcifications,  and,  to  exam  if  a  computer  can  make 
accurate  differential  diagnoses  based  on  conqiuter-extracted  features.  In  this  study,  features  of  the 
microcalcifications  and  their  clusters  were  automatically  extracted  from  digitized  conventional 
mammograms 

The  microcalcificati(Mis  were  segmented  using  the  following  method,  vdiich  is  described  in 
detail  elsewhere.  A  third-degree  polynomial  was  fitted  to  the  pixel-value  distribution  in  a  ROI  (region 
of  interest)  of  the  digitized  mammogram  in  both  hcaizontal  and  vertical  directions  to  reduce  the 
background  structure  of  the  Ineast  parenchyma.  The  microcalcification  was  thoi  delineated  by  region 
growing.  The  effective  thickness  of  the  microcalcificat!:on  (physical  dimension  along  x-ray  projection 
line)  was  estimated  fiom  signal  contrast  (mean  pixel  value  above  background)  of  the  isolated 
miciocalcifi(Miti(Mi.  This  was  done  by  first  converting  signal  contrast  in  terms  of  optical  density  to 
(xmtrast  in  terms  of  exposure  using  knowledge  of  the  H&D  curve  of  the  screen-film  system,  and 
secondly  ccmverting  crmtrast  in  toms  of  exposure  to  physical  dimension  using  die  exponential 
attenuation  law  assuming  a  "standard"  model  of  the  breast  and  the  microcalcificatkm.  The  standard 
model  assumes  (i)  a  4-cm  compressed  breast  composed  of  50%  adipose  and  50%  glandular  tissues; 
(ii)  a  mkaocalcification  composed  of  calcium  hydroxy^iatite  with  pl^sical  density  of  3.06  g/mm^; 
and  (iii)  a  20-keV  mon(x:hromatic  x-ray  beam.  Two  contrast  corrections  were  iqiplied  for  better 
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accuracy:  conqmisatioii  for  bhining  caused  by  the  screen-film  system  and  the  digitization  process, 
and  conqiensatitHi  for  x-ray  scatter. 

The  usefulness  o^die  features  were  evaluated  using  the  distributions  of  the  benign  and  malignant 
populations.  Features  capaUe  of  showing  separation  between  benign  clusters  from  the  malignant 
po{»ilation  were  chosoi  for  the  automated  classification.  Extracted  features  were  based  on  the  size, 
shape,  contrast,  and  uniformity  of  individual  microcalcificatimis;  and  the  size  and  sh^  of 
microcalcification  clusters.  An  artificial  neural  network  was  used  to  classify  benign  versus  malignant 
clusters  of  microcalcificatioos  using  8  computer-extrra:ted  features.  The  database  consisted  of  100 
images,  digitized  at  100-mm  pixel  size  and  10-bit  grey-scale  resolution,  from  53  patients  biopsied  for 
suspicion  of  breast  cancer  based  on  clustered  microcalcifications.  The  neural  network  correctly 
identified  47%  of  the  benign  patients,  all  of  whom  had  biopsies,  and  100%  of  the  maligrrant  patients. 
We  conclude  that  a  computer  is  capable  of  distinguishing  benign  from  nudignant  clust^ed 
microcalcifications  even  at  100-mm  pixel  size. 

(2)  Development  of  a  dedicated  CAD  module  for  use  by  radiologists. 

Experimental  methods 

The  various  conqwter-visitHi  and  artificial  intelligence  schen^  will  be  incmporated  into  a  dedicated 
computer  system  (module)  equipped  with  a  high-speed  conputer  and  a  digital  image  interface,  as 
shown  in  Figure  4.  The  digital  image  interface  will  initially  be  to  a  film  digitizo’  in  (wder  to  test  the 
CAD  schones  using  the  large  database  of  clinical  mammograms  available  in  our  Radiology  Dq>artm«it 
Later,  mammogrtphic  images  will  be  obtained  using  the  CR  system  tx  the  CCI>-based  digital  bitpsy 
unit  The  intelligent  modular  wt^cstation  will  need  to  have  sufficimt  conputer  power  (CPU  and  large 
capacity  memrxy)  and  display  capabilities  to  allow  for  "real-time"  cmnputation  and  viewing  of  die 
conputn'-vision  results.  Thus,  we  plan  to  upgrade  our  current  conputer  hardware  and  tptimize  our 
software  to  achieve  high-^)eed  and  efficient  conputadon  of  CAD  results.  Our  tar;^  is  to  reduce  the 
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CPU  time  required  for  CAD  conqNitations  from  the  current  level  of  about  S  minutes  per  image  to  a  few 
seconds.  Also,  appropriate  man-machine  interfaces  will  be  needed  for  effective  and  efficient  computer- 
assisted  interpretations.  This  part  of  the  research  will  involve  the  examination  of  various  methods  of 
presenting  the  computer-determined  results  to  the  radiologists.  Important  parameters  include  (a)  the 
sluq)e  and  size  of  tl»  markers  of  the  computer  output  that  could  represent  the  severity  or  confidence 
level  (probability)  of  the  lesion,  (b)  the  optimal  operating  point  of  the  CAD  schemes  (high  sensitivity 
with  an  acceptable  number  of  false  positives),  (c)  tlK  timing  and  duration  of  displaying  the  computer 
output,  (d)  the  selection  of  the  minimum  number  of  inputs  required  for  radiologists  and  (e)  the  user- 
Mendliness  of  instructions  and  input  entries. 


Hgure  4.  The  intelligent  modular  workstation.  The  distal  image  interface  will  be  the  film  digitizer  and  also  direct 
digital  imaging  devices  such  as  die  CR  system  and  the  digital  biopsy  unit  The  CAD  module  requires  a  high-speed  CPU 
and  large  capacity  memory.  Hardcopy  devices  include  the  laser  film  printer  and  the  economical  diemial-paper  image 
printer.  Optional  links  to  RIS  (radiology  information  system)  and  PACS  (picture  archival  and  communication  tystem)  are 
included. 

The  development  of  the  prototype  modular  system  wiU  be  achieved  in  stages.  In  Phase  1,  the 
introductitm  of  the  con^ta*- vision  aid  to  the  radiologists  will  be  inqilemented  with  minimum  change  in 
the  current  radiologist  method  of  operating.  This  will  allow  fcv  a  gradual  introduction  in  order  to 
minimize  any  resistance  to  change.  Thus,  only  computer-reported  detection  results  will  be  presented  to 
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the  radiologist,  leaving  all  of  the  interpretation  to  the  radiologist.  Basically,  the  computer  will  serve  as  a 
"second  opinion"  indicating  suspicious  areas  without  critique  as  to  their  degree  of  malignancy.  Original 
films  will  be  digitized  (2048  by  2048  digitization  matrix)  and  analyzed,  with  the  conqmter  output  then 
printed  on  either  film  or  thermal  piqier.  Radiologists  will  perform  their  normal  reading  using  the 
original  image  and  the  computer  results.  It  is  believed  that  this  introduction  of  CAD  to  radiologists  will 
cause  minimum  modification  to  their  normal  reading  patterns,  thus  allowing  for  a  smooth  and  effective 
transition.  During  Phase  2,  results  from  the  classification  schemes  also  will  be  included,  using  the 
methodology  described  for  Phase  1.  Howver,  in  this  second  phase  the  computer  will  serve  as  a 
"second  opinion"  for  both  the  location  and  the  interpretation  of  breast  lesions. 

During  the  first  two  phases,  we  will  investigate  the  best  markers  for  use  by  radiologists,  who  may 
prefer  arrows  or  circles  (icon-type  symbols).  It  should  be  noted  that  the  implementation  of  computer 
vision  in  niammognq)hic  screening  using  the  methods  described  above  is  not  limited  to  fully  digital 
(PACS)  departments  but  can  be  incorporated  in  a  general  film-based  radiology  dq>artment  or  in  a 
mobile,  filmless  manunography  unit  (i.e.,  a  limited  PACS  environment). 

Once  the  use  of  computer  vision  is  shown  to  be  useful,  beneficial  and  efficient,  we  will  incorporate 
high-resolution,  state-of-the-art  monitors  into  the  dedicated  computer  system  (I%ase  3)  as  shown  in 
Figure  1.  The  "intelligent"  module  will  be  interfaced  to  our  department's  RIS  (radiology  information 
system)  to  link  the  demogr3q>hic  and  medical  history  infcnmation  with  the  CAD  output  In  order  for  the 
radiologist  to  examine  the  entire  Ineast  image,  the  display  monitor  will  need  to  have  2K  by  2K 
capability.  In  mainmogrrq)hy,  each  breast  image  usually  can  be  digitized  adequately  into  a  2K  by  IK 
image.  Thus,  in  order  to  view  all  four  breast  images  (left  and  right  CC  views  and  left  and  right  MLO 
views),  two  high-resolution  2K  by  2K  monitors  are  needed.  However,  in  the  practice  of  radiology, 
films  (images)  fix>m  previous  examinations  play  an  imptxtant  role  in  the  current  exam  due  to  the  need 
for  coiiq>arison  in  mtler  to  detect  subtle  changes.  Thus,  the  di^lay  requiremrats  are  four  2K  by  2K 
monitors  (in  a  2  by  2  arrangement),  allowing  the  top  two  mcMiitors  to  be  used  for  sequencing  through 
previous  exams  of  the  patient  in  question.  In  this  phase,  the  radiologists  will  do  dieir  reading  of  tin 
mammographic  cases  from  the  high-resolution  monittvs.  Due  to  the  dynamic  nature  of  the  display,  the 
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computer-reported  results  can  be  presented  in  a  toggle  format  where  the  radiologist  can  press  a  button  to 
either  show  or  remove  the  computer-reported  results.  In  addition,  the  computerized  schemes  can  be 
configured  to  allow  for  the  radiologist  to  control  the  tradeoff  between  the  sensitivity  and  specificity  of 
the  computer  output,  because  more  true-positive  detections  always  can  be  achieved  at  the  cost  of  a  larger 
number  of  false-positive  findings,  and  vice  versa.  This  tradeoff  would  be  adjusted  by  the  radiologist, 
depending  on  the  nature  of  the  case  material  and  personal  preference.  For  exan^)le,  a  radiologist  might 
choose  a  computer  output  with  high  sensitivity  for  examining  high-risk  patients,  whereas  a  lower 
sensitivity  and  correspondingly  lower  false-positive  ntte  might  be  preferred  for  patients  at  low  risk  for 
cancer.  It  should  be  noted,  however,  that  increasing  the  number  of  interactive  choices  available  to  the 
radiologist  will  lengthen  the  reading  time  per  case.  Therefore,  we  will  investigate  optimization  of  the 
module's  human  interface  by  studying  the  relationship  between  achievable  diagnostic  accuracy  and 
required  reading  time. 

Results  to  date 

The  computerized  image  analysis  software  has  been  v  egrated  into  a  user  ftiendly  interface  based 
on  UNIX,  XWINDOWS  and  Motif  and  operated  on  an  IBM  RISC  6000  Series  570  computer 
workstation.  The  prototype  (hardware  &  software)  was  demonstrated  at  the  1994  annual  meebng  of  the 
Radiological  Society  of  North  America  (RSNA)  and  was  well  received  by  the  many  radiologists  in 
attendance.  Currently,  arrows  (red  for  masses  and  yellow  for  clustered  microcalcifications)  are  used  to 
indicate  the  computer-detected  location  of  lesions.  The  input  to  the  system  can  be  either  a  film  that  is 
digitized  and  then  analysed  automatically  or  a  corrqHiter  file  containing  a  digital  image.  The  prototype 
system  is  interfaced  to  a  Konica  laser  film  digitizer  which  enables  digitization  of  the  mammograms  to 
ai^mximately  2K  by  2K  matrices.  Video  output  of  the  IBM  monitor  is  connected  to  a  low-resolution 
thmnal  [ninter  (apjKoximately  IK  by  IK)  for  hardcopy  rqxnting  of  the  CAD  results.  The  prototype 
system  will  be  transferred  to  the  clinical  reading  area  for  the  next  phase  of  developmrat  and  testing. 
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(3)  Evaliution  procedure  using  large  clinical  databases 

Experimental  methods 

As  described  in  the  previous  section,  the  computer-vision  methods  for  nuunmography  will  be 
developed  in  phases.  Plans  include  testing  the  computer-vision  system  at  the  end  of  each  phase  in  order 
to  (temonstrate  the  effect  of  the  various  modes  of  presentation  on  the  accuracy,  efficiency  and 
acceptability  of  the  mammografdiic  aid.  The  system  will  be  evaluated  using  clinical  mammograms 
obtained  from  both  a  low-risk  pqwlation  and  a  high-risk  pq>ulation.  Tfie  low-risk  population  will  be 
obtained  from  The  University  of  Qiicago  mammogn^hy  screening  program.  The  high-risk  population 
will  be  drawn  from  examinations  referred  to  our  Department  of  Radiology,  since  The  University  of 
Chicago  sores  as  a  tertiary  medical  center.  Initially,  performance  studies  will  be  done  using  a  database 
of  preselected  mammogrs^hic  cases  that  have  a  distribution  of  subtle  cases  of  rKMrmal,  benign  and 
malignant  areas  of  either  masses  or  microcalcifications.  Later  studies  will  be  performed  using  a  more 
representative  database  of  consecutive  mammograj^c  cases  obtained  from  four  weeks  worth  of 
screening.  "Truth"  concerning  the  {xesence  and  malignancy  of  masses  and  microcaicifications  will  be 
established  with  the  aid  of  expert  mammogn^hers,  follow-up  rqx>rts  and  surgical  biopsy  reports. 
Nmmal  cases  will  be  selected  from  patients  who  have  had  normal  follow-up  exams.  Performance 
studies  will  be  done  using  cases  involving  the  four  conventional  mammograms  (left  and  right  CC 
views,  and  left  and  right  MLO  views),  since  these  are  the  usual  images  obtained  in  scteoiing. 

At  the  detection  stage  of  the  crmqniter-visirm  system,  perfrxmance  will  be  examined  by  calculating 
the  fractkm  of  lesions  detected  (true-positive  rate)  and  the  numbo’  of  falsely-rqxxted  areas  pa*  case.  At 
the  classification  sta^  of  the  crmqMttf-visirm  system,  performance  will  be  examined  by  calculating  the 
fraction  of  malignant  cases  correctly  classified  (true-positive  classification  rate)  and  tiie  number  of 
benign  cases  that  ate  tq)oi1ed  by  the  coitqMitar  as  bdng  malignant  (false-positive  classification  rate). 

The  clinical  database  for  these  perfottnanoe  evaluations  will  contain  180  cases  (60  normal,  30  with 
benign  masses,  30  with  malignant  masses,  30  with  benign  microcaicifications,  and  30  with  malignant 
mictocalcificati(Mis). 
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Observer  studies  will  be  performed  to  examine  the  usefulness  of  the  conqxiter-assisted 
interpretation  process  in  enhancing  radiologists'  performance  levels,  as  compared  to  the  unaided 
performance  by  radiologists.  During  phases  1  and  2,  the  database  cases  will  be  printed  with  the 
computer-vision  results  on  each  film.  These  database  cases  will  then  be  used  in  observer  performance 
studies.  Stratified  sampling  (106)  will  be  used  in  choosing  subtle  cases  in  order  to  avoid  problems 
associated  with  either  "too  easy"  or  "too  difficult"  cases  (107).  Twelve  attending  radiologists  and 
senior  residents  will  act  as  observers.  Then,  for  the  180  cases  in  the  database,  three  "reading  methods" 
will  be  tested;  (a)  the  original  cases  without  the  computer- vision  aid,  (b)  the  cases  with  the  detection- 
results  reported  (phase  1  computer  locations  of  suspicious  areas)  and  (c)  the  cases  with  both  the 
detection  and  classification  results  reported  (phase  2  computer  locations  with  probability  of 
malignancy).  Each  observer  will  be  asked  to  perform  two  tasks:  (1)  locate  and  rate  suspicious  areas  as 
to  the  presence  of  an  abnormality  (rating  scale  of  0  to  100)  and  (2)  indicate  an  overall  level  of  certainty 
as  to  the  presence  of  cancer  using  a  S-point  rating  scale  where  l=definitely  benign  and  S=definitely 
malignant  This  five-point  scale  is  the  same  as  that  being  recommended  by  the  American  College  of 
Radiology  for  routine  use  by  clinical  mamriM^r^^rs.  The  dual-task  observer  study  will  allow  for 
evaluation  of  the  utility  of  both  the  conqHiter-visicm  detection  and  classification  results.  (In  addition, 
questiormaires  will  be  given  to  each  obsover  in  order  to  obtain  subjective  information  with  regard  to  the 
efficiency  and  acceptability  of  the  corrqrutcr-vision  mammography  system)  In  die  analysis  of  die 
observer  study  results,  maximum  likelihood  estimation  (108)  will  be  used  to  fit  a  binormal  ROC 
(receiver  operating  charactmsdcs)  curve  to  each  observer’s  confirtence-rating  data  from  each  diagnostic 
method.  The  index  Az,  which  represrats  die  area  under  a  binormal  ROC  curve,  will  be  calculated  for 
each  fitted  curve.  To  rqnesent  the  average  poformance  of  the  observers  for  each  diagnostic  method, 
the  corrqxisite  RC  C  curves  will  be  calculated  by  avnaging  the  slope  and  intercqit  parameters  of  the 
individual  observer-^iecific  ROC  curves.  The  statistical  significance  of  qipaient  diffoences  betweoi 
pairs  of  diagnr^tic  methods  will  then  be  analyzed  flying  a  "two-tailed"  t-test  for  paired  data  to  the 

observer-specific  Az  index  values. 
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Free-response  RCXT  (FR(X)  analysis  (109)  and  FROC-AFROC  analysis  (1 10)  will  be  used  in 
analyzing  the  data  pertaining  to  localization  of  the  abnormality.  The  (xdinates  of  both  FRCX!  curves  and 
AFROC  curves  are  the  fraction  of  lesions  (masses  or  microcalcifications)  that  are  correctly  localized  by 
the  observer.  However,  the  abscissa  of  an  FRCXI!  curve  is  the  average  number  of  false  positives  per 
image,  whereas  the  abscissa  of  an  AFRCXZ  curve  is  the  probability  of  obtaining  a  false-positive  image 
(i.e.,  an  image  containing  one  or  more  false-positive  responses). 

After  phase  3,  another  observer  study  will  be  perfumed  in  which  four  weeks'  worth  of 
maimnographic  cases  will  collected  and  interpreted  by  six  radiologists  with  and  without  the  con^Miter- 
vision  results  of  location  and  classification.  Although  dris  database  lacks  the  control  over  the  subtlety  of 
the  cases  that  the  earlier  mentioned  study  has,  it  rejMesents  a  more  typical  clinical  situation.  Half  of  the 
radiologists  wil!  read  the  first  two  weeks  of  cases  without  aid  and  the  second  two  weeks  of  cases  with 
the  mammographic  aid;  and  the  other  half  of  the  radiologists  will  read  the  first  two  weeks  of  cases  with 
the  aid  and  the  second  two  weeks  of  cases  without  the  aid.  Rating  m^hods  and  analyses  will  be  the 
same  as  mentioned  above. 

Results  to  date 

FROC  analysis  and  R(X!  analysis  has  been  used  extensively  for  the  intermediate  testing  results  of 
the  various  detection  and  classification  methods.  Constant  collection  of  the  database  is  tmgoing. 
Investigators  have  developed  a  case  reporting  sheet  for  organizing  the  new  cases  on  a  Macintosh 
computer  using  FileMakerPro  software.  The  various  databases  being  coUected  include  pathologically- 
proven  mass  and  clustered  microcalcification  cases.  In  addition,  a  "missed  lesitm"  database  is  being 
digitized  in  order  to  test  the  detection  methods  in  the  upcoming  grant  period.  This  database  includes 
lesions  that  were  seen  in  retrospect,  i.e.,  after  the  cancu  was  detected  at  a  later  date.  This  database  will 
demuistrrue  the  ability  of  the  detection  schemes  to  iiKrease  the  sensitivity  of  detectiui  in  a  screening 
program.  In  a  preliminary  study  (i»esented  at  the  RSNA  94)  in  which  26  "missed  lesion"  cases  were 
analyzed,  the  cunputerized  duection  schemes  achieved  a  sensitivity  of  50%.  (Note  that  these  "missed 
lesion"  cases  can  be  thought  of  yielding  a  sensitivity  of  0%  when  they  had  been  read  by  the 
radiologists). 
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CONCLUSIONS 

Substantial  inimvements  in  the  performances  of  the  con^ter-aided  diagnosis  methods  for  the 
detection  of  masses  and  clustered  microcalcifications  have  been  achieved  during  the  past  funding  period. 
For  the  detection  of  masses,  the  sensitivity  increased  hmn  85%  to  94%,  while  tl^  false-positive  rate  per 
image  reduced  from  4  to  2.5  per  image.  For  the  d^ection  of  clustered  microcalcifications,  the  false¬ 
positive  rate  was  reduced  from  2  per  image  to  approximately  0.7  per  image,  without  loss  in  sensitivity. 

Constant  collection  of  the  database  is  ongoing.  Investigators  have  developed  a  case  reporting  sheet 
for  (M^anizing  the  new  cases  on  a  Macintosh  computer  using  FileMakerPro  software.  The  various 
databases  being  collected  include  pathologically-proven  mass  and  clustered  microcalcification  cases. 
Databases  for  both  mammograms  containing  mass  lesions  and  mammograms  containing 
microcalcifications  have  both  increased  in  size  and  some  have  been  digitized  on  more  than  one  digitizer  in 
orda*  to  observe  the  affect  of  digitization  on  detection  performance.  In  addition,  a  "missed  lesion" 
database  is  being  digitized  in  order  to  test  the  detection  methods  in  the  upcoming  grant  period.  This 
database  includes  lesions  that  were  seen  in  retrospect,  i.e.,  after  the  cancer  was  detected  at  a  later  date. 
This  database  demonstrate  die  ability  of  the  detection  schemes  to  increase  the  sensitivity  of  detection 
in  a  screening  program.  In  a  prdiminary  study  (jxesented  at  the  RSNA  94)  in  which  26  "missed  lesion" 
cases  were  analyzed,  die  conqMterized  detection  schemes  achieved  a  sensitivity  of  50%.  (Note  that  these 
"missed  lesion"  cases  can  be  tfaou^t  of  yielding  a  sensitivity  of  0%  when  they  had  been  read  by  the 
radiologists). 

With  regard  to  the  dassificatioa  of  mammographic  lesions  as  an  aid  in  distinguishing  between 
malignant  and  boiign  cases,  die  initial  perfmmances  fix'  both  masses  and  microcalcifications  has  been 
quite  promising.  In  the  classsification  of  masses,  an  Az  (area  under  the  ROC  curve)  of  0.90  was 
obtained  fnxn  die  ROC  analysis  of  the  ouqwt  finxn  the  neural  network,  which  was  used  to  merge  the 
extracted  features  of  the  lesions.  In  die  classificadcm  of  clustered  microcalcifications  a  neural  network 
correcdy  idmitified  47%  of  die  benign  patients,  all  of  whom  had  biqisies,  and  100%  of  the  malignant 
patients.  We  conclude  that  a  crxnputn'  is  capable  of  distinguishing  benign  from  malignant  clustered 
microcalcifications  even  at  100-mm  pixel  size. 
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The  computerized  image  analysis  software  has  been  integrated  into  a  user  friendly  interface  based  on 
UNIX,  XWINDOWS  and  Motif  and  operated  on  an  IBM  RISC  6000  Series  570  computer  workstation. 
The  prototype  (hardware  &  software)  was  demonstrated  at  the  1994  annual  meeting  of  the  Radiological 
Society  of  North  Ammca  (RSN  A)  and  was  well  received  by  the  many  radiologists  in  attendance.  The 
input  to  the  system  can  be  either  a  film  diat  is  digitized  and  then  analysed  automatically  or  a  computer  file 
containing  a  digital  image.  The  {xototype  system  will  be  transfoied  to  the  clinical  reading  area  for  the 
next  phase  of  development  and  testing. 

We  are  very  optimistic  about  the  continuing  success  of  our  research.  We  will  continue  to  improve 
the  detection  and  classification  perfrxmance  of  our  algcxithms.  The  mammographers  in  the  clinical 
reading  area  of  the  department  are  lodcing  fnward  te>  the  placemmt  of  the  prototype.  Weekly  meetings 
ate  held  between  the  basic  science  and  clinical  researches  in  order  to  ensure  a  smooth  integration  of  the 
workstation  in  the  clinical  arena. 
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